Dynamic Load Sharing with Unknown Memory Demands in Clusters

نویسندگان

  • Songqing Chen
  • Li Xiao
  • Xiaodong Zhang
چکیده

A compute farm is a pool of clustered workstations to provide high per$ormance computing services for CPU-intensive, memory-intensive, and I/O active jobs in a batch mode. Existing load sharing schemes with memory considerations assume jobs’ memory demand sizes are known in advance or predictable based on users’ hints. This assumption can greatly simplify the designs and implementations of load sharing schemes, but is not desirable in practice. In order to address this concern, we present three new results and contributions in this study ( I ) Conducting Linux kernel instrumentation, we have collected different types of workload execution traces to quantitatively characterize job interactions, and modeled page fault behavior as a function of the overloaded memory sizes and the amount of jobs’ I/O activities. ( 2 ) Based on experimental results and collected dynamic system information, we have built a simulation model which accurate1.y emulates the memory system operations and job migrations with virtual memory considerations. (3) We have proposed a memorycentric load sharing scheme and its variations to effectively process dynamic memory allocation demands, aiming at minimizing execution time of each individual job by dynamically migrating and remote1.y submitting jobs to eliminate or reduce page faults and to reduce the queuing time for CPU services. Conducting trace-driven simulations, we have examined these load sharing policies to show their effectiveness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamic Cluster Resource Allocations for Jobs with Known and Unknown Memory Demands

ÐThe cluster system we consider for load sharing is a compute farm which is a pool of networked server nodes providing high-performance computing for CPU-intensive, memory-intensive, and I/O active jobs in a batch mode. Existing resource management systems mainly target at balancing the usage of CPU loads among server nodes. With the rapid advancement of CPU chips, memory and disk access speed ...

متن کامل

Dynamic Load Sharing With Unknown Memory Demand of Jobs in Clustered Compute Farms

A compute farm is a pool of clustered workstations to provide high performance computing services for CPU-intensive, memory-intensive, and I/O active jobs. Existing load sharing schemes with memory considerations assume jobs’ memory demand sizes are known in advance or predictable based on users’ hints. This assumption can greatly simplify the designs and implementations of memory-centric schem...

متن کامل

Voltage Control and Load Sharing in a DC Islanded Microgrid Based on Disturbance Observer

Increasing DC loads along with DC nature of distributed energy resources (DERs) raises interest to DC microgrids. Conventional droop/non-droop power-sharing in microgrids suffers from load dependent voltage deviation, slow transient response, and requires the parameters of the loads, system and DERs connection status. In this paper, a new nonlinear decentralized back-stepping control strategy f...

متن کامل

Experiences with the Mether-NFS Virtual Shared Memory System

Workstation clusters have recently attracted high interest as a technology providing supercomputer class performance at much lower price levels. The message passing programming model dominates the application development, despite the overhead and the complexity introduced by the explicitly coded synchronisation and data transfers. We give an introduction to the virtual shared memory programming...

متن کامل

Local Cluster First Load Sharing Policy for Heterogeneous Clusters

This paper studies the load sharing problem among heterogeneous cluster systems. The heterogeneous clusters we consider are time-sharing, and the computers in these clusters have different CPU powers and memory capacities. Load sharing means even workloads among all coordinated computers in the system. As some nodes suffer from high loading, it is necessary to migrate some jobs to the nodes wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001